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Abstract 

Starting from the Hamilton-Jacobi equation describing a classical 
ensemble, one may infer a quantum dynamics using the principle of 
maximum uncertainty. That procedure requires an appropriate mea- 
sure of uncertainty: Such a measure is constructed here from phys- 
ically motivated constraints. It leads to a unique single parameter 
extension of the classical dynamics that is equivalent to the usual lin- 
ear quantum mechanics. 



1 Deconstructing the Schrodinger equation 

Despite its remarkable quantitative success, quantum mechanics continues to 
puzzle us with its seemingly counter-intuitive predictions. Even the mathe- 
matical formalism most widely used for its description appears very different 
from that used in classical mechanics: one sees in quantum mechanics the 
appearance of complex numbers, probability amplitudes and an apparently 
exact linear evolution equation. 
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In this paper the structure of Schro dinger's equation, in particular its 
hnearity, will be derived within an information theoretic framework to be 
elaborated on below. The various assumptions involved in the derivation 
will also be discussed at length. 

Let us begin with a review of the Schrodinger equation for particles in 
d + 1 dimensions, 

• \ 

ihip = — —9ijdidj + V 



ij (1) 



where i,j = 1,2, ,dN and the metric is defined as Qij = Sij/m(^i-^ with 

the symbol (i) defined as the smallest integer > i/d. That is, i = l,...d, 

refer to the coordinates of the first particle of mass mi, i = d + 1, 2d, 

to those of the second particle of mass 1712 and so on. The overdot refers 
to a partial time derivative and the summation convention is used unless 
otherwise stated. Cartesian coordinates have been chosen as these allow 
an unambiguous correspondence between observables such as momenta and 
their operator representation yj. 

The metric Qij occurs naturally in the description of the system in con- 
figuration space 13 Ej and plays a crucial role in the discussion below. It is 
pertinent to note that the metric g^j is diagonal and positive-definite. This 
is a consequence of the form of the kinetic term in the Schrodinger equation 
in Cartesian coordinates. 

Since our intuition is mostly classical, it is useful to rewrite the Schrodinger 
equation in a form which allows comparison with Newtonian physics. The 
Madelung transformation |1] = e'^^^^ decomposes the Schrodinger equa- 
tion into two real equations. 



S+f4Sa,S + V'-|,,».M<P) =0. (2) 



p + g,jd,(pajS) = 0. (3) 

The first equation is a generalisation of the usual Hamilton- Jacobi equation, 
the term with explicit h dependence (the "quantum potential" PP) sum- 
marising the peculiar and nonlocal aspects of quantum theory. The second 
equation is a continuity equation expressing the conservation of probability, 
/ p{x, t)dx^^. 

Equations ((21 El) may be obtained from a variational principle [HI, one 



2 



minimises the action 



/ 



P S +^^^S^,S + V dx^'^dt + —If 
I 2 J 8 



(4) 



with respect to the variables p and S. Interestingly, the quantity 



/ 



dx dt gij p {di \ogp) {dj \ogp) 



(5) 



resembles the "Fisher information" [S', whose inverse sets a lower bound 
on the variance of the probability distribition p{x) through the Cramer- Rao 
inequality jHl IHI- Since a broader probability distribution p{x) represents 
a greater uncertainty in x, the term Ip is actually an inverse uncertainty 
measure. 

The equations (jHEI) were used in Ref.0[7] to derive Schrodinger's equa- 
tion through a procedure analogous to the principle of maximum entropy 
(uncertainty) |HJ E] used in statistical inferrence theory. The idea is that 
without the term Ip, variation of Eq.(jD) gives rise to equations describing 
a classical ensemble. As the probability distribution p{x) characterising the 
ensemble is supposed to represent some fluctuations of unknown origin, we 
would like to be as unbiased as possible in its choice. This is achieved by 
choosing the broadest distribution possible, representing our maximum un- 
certainty. Technically, this is implemented in (@)) by minimising (0) when 
varying the classical action: fi^/S is the Lagrange multiplier. 

2 Constructing the Measure 

However, the intriguing approach of Ref. ^2, does not explain, a priori, the 
form of the information measure that should be used. That is, why must Ip 
be minimised rather than something else? 

The goal of this section is to construct, from first principles, information 
measures that are permissible. To fix the notation, consider therefore the 
same classical ensemble as in Sec.(l), but now constrained by a general in- 
formation measure I whose form is to be determined. The relevant action is 
then 




p S + ^diSdjS + V dx^'^dt 



(6) 
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with A a Lagrange multiplier. Varying this action will give rise, in general, to 
a nonlinear Schrodinger equation after an inverse Madelung transformation, 

V> + (7) 

with F representing the nonlinearity. 

In order to construct an explicit form for the information measure /, 
constraints need to be imposed. These constraints are of two types. The first 
type are those that are required for / to be sensibly interpreted as a measure 
of inverse uncertainty (information). The condition [SI] below belongs to 
this type. 

The other constraints [S2\ — [5*6] to be discussed below are of a differ- 
ent type. One may adopt two alternative perspectives to motivate these 
constraints. The first, classical, perspective is to view the action Q as a 
generalised form of classical dynamics. In that case a minimal deformation 
of the usual classical dynamics is achieved if the additional constraints are 
the same as those already satisfied by the J = part of locality, ho- 
mogeneity, separability, Gallilean invariance, and absence of more than two 
derivatives in each product of terms in the action. Thus by using [S2] — [SQ] 
one is not imposing any constraints on the action that do not already exist. 
In this classical perspective the physical motivations for the constraints are 
either the usual ones with an obvious interpretation (locality, separability, 
Gallilean invariance), or else they are explained below. 

The second, quantum, perspective is to view the conditions [S2] — [SQ] 
with respect to the interpretation of Eq.((7j), which represents a generalised 
Schrodinger equation. As the usual linear quantum mechanics has been ex- 
perimentally well tested, one could argue that it makes sense to only consider 
those potential deformations of the linear theory that permit as much of the 
usual interpretations of the wave-function as possible. Remarkably, as dis- 
cussed below, the same conditions [5*2] — [SQ] motivated by the classical 
perspective are needed also for this quantum viewpoint. 

Here then are the axioms: 

• [5*1] (i) Firstly, by definition, the measure I should be a real-valued and 
positive definite functional for allp = il^'^ip. More specifically, we would 
like the measure to be universal in the sense of being independent of 
the external potential V. 

(ii) Also, the interpretation of / as an information (inverse uncertainty) 
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measure requires that it should approach a minimum when p is uni- 
formly distributed. (A minimum exists because by (i) / is positive 
definite). 

• [5*2] Locality: / should be of the form I = J dx^'^dt pH{p) where H 
is a function of the probability p{x, t) and and its spatial derivatives. 
In the quantum perspective this local form ensures the validity of the 
weak superposition principle in the equations of motion ((7j): states with 
negligible overlap will not infiuence each other strongly. (In principle 
one should also allow / to depend on x and time derivatives. These 
generalisations are discussed in the next section.) 

• [5*3] Homogeneity: H should be invariant under scaling, H{\p) = 
H{p). The normalisation of probability, 1 = / dx^'^p{x,t), implies that 
the dimensions of p{x, t) depend on the dimensions of the configuration 
space. Thus by demanding H to be scale invariant one ensures that 
the resulting equations of motion, whether in the classical or quantum 
perspective, have a universal form independent of the number of par- 
ticles. Thus this condition may be viewed as restricting the search to 
universal dynamics. 

(There is another motivation sometimes given for this homogeniety con- 
dition: It allows solutions of ((Tj) to be (re) normalised and thus allow 
for the usual interpretation of states after a measurement process, as 
discussed in ITU].) 

• [54] Separability: H should be separable for the case of two inde- 
pendent sub-systems described by probability distributions pi and p2: 
H{p = P1P2) = H{pi) + H{p2). 

• [55] H should be Gallilean invariant^. 

• [5*6] H should not contain more than two derivatives in any product 
of terms that appears in it. As each derivative involves an inverse 
length, this condition obviously restricts the number of new dimensional 
parameters that can appear in (j6l7j) . This condition will be referred to 
in brief as "absence of higher number of derivatives" or "AHD" . 

As will be apparent later, the implementation of this condition means 

^In this paper, when discussing the symmetry of the Schrodinger equation one will 
always refer to the case of vanishing potential, F = 0. 
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that the lagrange multipher A in (jUl), and hence Planck's constant, 
is the only new parameter that is required in making the transition 
from classical to quantum mechanics. Conversely, relaxing the AHD 
condition would imply the appearance of other parameters, with the 
dimensions of length, and a generalised form of quantum dynamics. 

1 would like to reiterate that the conditions [52] — [5*6] above are all already 
satisfied by the classical part of the action so demanding them of the 
additional piece / is actually quite natural and minimalist. The additional 
motivations provided by viewing the equations from the quantum perspective 
are simply a bonus. 

Having enumerated and motivated the axioms, one may begin construct- 
ing the measure. Clearly the homogeneity requirement [S3] cannot be sat- 
isfied if H depends only on p : it must also contain derivatives of p. The 
AHD condition and rotational invariance imply that the building blocks of 
H must be 

gijUidiU2djU3 and gijVididjV2, (8) 

where the f/j, Vi are functions of p. Separability can now be used to deduce 
that H must be linear in Qij: 

H = g,, {UidM,U3 + Vid,d,V2) . (9) 

One may consider sums of such structures and so place an additional index 
n on the Ui,Vi but it is easy to check that the final result below remains 
unchanged. One could also use separability to restrict the explicit forms 
that U, V may take but I will use the homogeneity condition for that purpose 
below. (Note that if the form of H{p) were restricted so that derivatives of 
p occured only in polynomial form, as is commonly done in physics, then 
separability would not be required in obtaining (jH))). 
Using the chain rule one can rewrite Q as 

H = g,, (^^[U,U'2U',p' + V.V^p'] + ^[FiF^'p]) , (10) 

where the prime symbol denotes a derivative with respect to p. Consider now 
the scaling p —>■ Xp under which H is required to be invariant. The terms 
inside the square brackets become dependent only on the product Xp. Since 
the forms ^'^f^^ and are distinct and independently scale invariant, this 
means that the terms in square brackets must also be independent of A: but 
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since those terms depend only on Ap, this imphes that the terms in square 
brackets are simply constants. 
Thus one obtains 

/ = / dtdx^^g,, [a^-^ + . (11) 

The "5" term gives a surface contribution which might not vanish for some 
wavef unctions and so its contribution to / is of indefinite sign. The positivity 
and universality of I therefore requires us to choose B = 0. 

Hence one concludes that the unique solution of the conditions [SI] — 
[5*6] is the information measure Ip given in (0). The Lagrange multiplier 
A in © must then have the dimension of (action)^ thereby introducing the 
Planck constant into the picture; the equation of motion is then the linear 
Schro dinger equation. 

Interestingly, neither the second part of [5*1] nor the full Gallilean in- 
variance was used explicitly in the above construction eventhough the final 
result, the measure Ip, does satisfy all the conditions. However these addi- 
tional constraints will be useful in the next section. 

It should be noted that, as shown in jTTj, the action (0} increases for 
variations of p that keep S fixed, the increase being due to an increase in 
If, so that the resulting solutions are not just an extremum but do indeed 
minimise the information measure Ip- 

The positivity condition in [5*1] plays an important physical role beyond 
ensuring the existence of a minimum for I (state of maximum uncertainty). 
It also guarantees that the following energy functional is bounded from below 
for potentials V that are likewise bounded: 

£[S,p] = I d^'^x p {^d,Sd,S + V + \H{p)^ (12) 

where the function H is defined in [5*2]. This functional is conserved for 
stationary states and it also reduces to the average of the usual quantum 
mechanical hamiltonian for the case of the linear theory. These properties of 
the energy functional qualify it as the most natural to use for defining the 
energy of the system in a potential generalisation (nonlinear) of quantum 
theory. 

The meaning of the AHD condition can be elucidated with an explicit 
example. Consider 

Hi{,p) = gijdiiXogp + T]f{p))dj{\ogp + r]f{p)) (13) 
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with f{p) = gki{dklogp){di\ogp) and r] a constant. This Hi satisfies all 
the constraints except AHD. The price to pay is the appearance of an addi- 
tional parameter, rj, required to balance the dimensions of the higher order 
derivatives. 

Thus the AHD condition ensures that, within the information theoretic 
approach, the Schrodinger equation is the unique single parameter extension 
of the classical statistical Hamilton- Jacobi equations. Since the information 
theoretic approach attempts to provide an unbiased description of data, one 
may say that the AHD condition further restricts us to the simplest unbiased 
description. 

3 Relaxing some conditions 

It has been implicitly assumed in the last section that the metric Qij that 
appears in / is the same as the metric in the classical part of the action. If 
one allows in the measure a metric Qij which is still diagonal but different 
from the classical metric Qij then a nonlinear Schrodinger equation apparently 
ensues. However the nonlinearity can be removed by a change of variables 
(a nonlinear gauge transformation) fT^ IT^ with the result that for a range 
of values of the Lagrange multiplier A one actually recovers the usual linear 
Schrodinger equation. (This example highlights the point that a nonlinear 
Schrodinger equation cannot immediately, and in all generality, be declared 
pathological). However for the remaining values of A one obtains after the 
change of variables ^H] a linear diffusion equation. It will be assumed for 
the rest of this paper that the classical metric is used also in the information 
measure when symmetries are preserved. 

Consider now allowing I and hence H to depend also on the phase factor 
S{x, t) in addition to the probability density p{x, t). By definition, the phase 
factors 5*1 and 5*2 for two independent systems are additive in the composite 
system, 5* = Si + 82- One can proceed as before and consider the most 
general structures restricted by rotational invariance, AHD, homogeneity, 
separability and positivity. The result is a generalised measure of the form 

Jq = y dtdx^'^ pgijdi{ai logp + a2S)dj{ai logp + 028), (14) 

where the ak are constants. In arriving at this structure, homogeneity has 
only been used to imply that derivatives of p occur, while separability is the 
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stronger constraint as it acts also on the S variable. However the second part 
of the condition [5*1] requires that 02 = 0, thus eventually one is again led 
to the Fisher form. (Again, more generally one may place another index n 
on the ak and sum over such terms but the conclusion remains unchanged.) 

Nevertheless, the special case of (fT^ with ai = but a2 7^ is sufficiently 
interesting to deserve further study because like the classical measure Ip, it 
is positive definite by itself, but unlike Ip it also contains some information 
about the phase of the wavefunction. Now, if used in the variational action, 
this S dependent term can be absorbed, after a scahng of the metric, by a 
similar term already existing in the classical part of the action (jH)). The net 
result is therefore still a linear Schrodinger equation but with a mass, m(j) 
which is renormalised with respect to the original mass parameter m(j) in the 
classical theory. Empirically this renormalisation will have no consequence 
if all calculations, as usual, refer to the mass parameter appearing in the 
quantum theory. 

One may also consider allowing time derivatives of p and 5* in H. However 
the demands of positivity confiict with those of separability unless one relaxes 
the AHD condition: then it is possible to have structures such as 

H2{p) = 9ijdi (^ogp + dj (^ogp + (15) 

that contain dimensionfull parameters. 

Finally, one may consider an explicit dependence on the coordinates, Xi, 
in H. However such terms are ruled out by translational invariance. 



4 Conclusion 

If one adopts the philosophy that the laws of physics should be constructed so 
as to provide the most economical and unbiased representation of empirical 
facts, then the principle of maximum uncertainty jHl E] is the natural avenue 
by which to investigate the foundations of quantum theory [7| E] . 

The investigation here has extended the initiative of [3] in several ways. 
Firstly, the constraints that a relevant information measure should satisfy 
have been made explicit and motivated from two alternative perspectives. 
Secondly, the measure has been constructed from the constraints rather than 
postulated, thus motivating the structure of the linear Schrodinger equa- 
tion. Indeed, it has been shown here that within the information theoretic 
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approach, the hnear Schrodinger equation is the unique one-parameter ex- 
tension of the classical dynamics. 

One should compare the approach here with an alternative, axiomatic, 
but not information theoretic based, construction of the Schrodinger equa- 
tion from classical mechanics discussed in Starting from the classical 
Hamilton- Jacobi equations, the authors add constrained fluctuations to the 
kinetic term. The result is an equation similar to Eq.® above with an ex- 
plicitly postive definite H{p) and with the symmetries of the classical action. 
However, in addition to the major differences in motivation and epistemol- 
ogy, there is a significant technical contrast between this paper and Ref . [H] : 
there the authors postulate and use an "exact uncertainty relation" instead 
of the homogeneity condition [S3] adopted here. As discussed above, the 
homogeneity condition is already satisfied by the classical action and has the 
interpretation of making the form of the equations of motion independent of 
the dimension of configuration space, and thus it is a simple restriction to 
universal dynamics. 

An open and interesting problem is to extend the constructive approach 
adopted in Sec. (2) to include spin [15^ and relativistic effects [2]. This might 
involve further refinement of the conditions in Sec. (2) and might result in 
the use of more general information measures such as Iq of eq. ()14|) . 

Finally, one may enquire into the possible generalisations of quantum me- 
chanics (typically non-linear) that result from omitting one or more of the 
conditions [SI] — [S6] and the ensuing phenomenological consequences. These 
issues are discussed elsewhere |16J (iTj. It is of interest to note that one of 
the earliest suggestions for a nonlinear Schrodinger equation pjj had a \ogp 
nonlinearity, which is allowed if the homogeneity condition is abandoned. 
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